Exploring the use of Machine Translation resources for English-Japanese Cross-Language Information Retrieval
نویسندگان
چکیده
This paper describes a detailed investigation into the use of Machine Translation techniques for query translation in English-Japanese Cross-Language Information Retrieval (CLIR). Experimental results are reported for the standard BMIR-J2 Japanese text retrieval collection extended to a CLIR task. Results indicate that retrieval performance increases as greater linguistic processing is used in the translation process, and that local feedback via term reweighting and query expansion is e ective for this task.
منابع مشابه
Indonesian-Japanese CLIR Using Only Limited Resource
Our research aim here is to build a CLIR system that works for a language pair with poor resources where the source language (e.g. Indonesian) has limited language resources. Our IndonesianJapanese CLIR system employs the existing Japanese IR system, and we focus our research on the IndonesianJapanese query translation. There are two problems in our limited resource query translation: the OOV p...
متن کاملColing • Acl 2006
Our research aim here is to build a CLIR system that works for a language pair with poor resources where the source language (e.g. Indonesian) has limited language resources. Our IndonesianJapanese CLIR system employs the existing Japanese IR system, and we focus our research on the IndonesianJapanese query translation. There are two problems in our limited resource query translation: the OOV p...
متن کاملComparing Multiple Methods for Japanese and Japanese-English Text Retrieval
The NACSIS collection of Japanese scienti c documents (with English titles) provides a solid foundation for information retrieval research into 1) segmentation methods for Japanese text, 2) e ective methods for monolingual Japanese retrieval, and 3) JapaneseEnglish cross-language retrieval. This paper compares multiple methods for Japanese and Japanese-English text retrieval. Our focus is on ac...
متن کاملCMU in Cross-Language Information Retrieval at NTCIR-3
We participated in the Cross-Language Information Retrieval evaluation at NTCIR-3 for the EnglishChinese and English-Japanese tasks. We examined several approaches to query translation, including the use of a commercial machine translation system, a thesaurus that is automatically extracted from a parallel corpus, and a general-purpose online dictionary. The MT-based approach was most effective...
متن کاملHow Similar are Chinese and Japanese for Cross-Language Information Retrieval?
For NTCIR Workshop 5 UC Berkeley participated in the bilingual task of the CLIR track. Our focus was on Chinese topic searches against the Japanese News document collection, and on Japanese topic search against the Chinese News Document Collection. Extending our work of NTCIR 4 workshop, we performed search experiments to segment and use Chinese search topics directly as if they were Japanese t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999